NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Summary statistics of learning link changing neural representations to behavior

https://doi.org/10.3389/fncir.2025.1618351

Zavatone-Veth, Jacob A; Bordelon, Blake; Pehlevan, Cengiz (August 2025, Frontiers in Neural Circuits)

How can we make sense of large-scale recordings of neural activity across learning? Theories of neural network learning with their origins in statistical physics offer a potential answer: for a given task, there are often a small set of summary statistics that are sufficient to predict performance as the network learns. Here, we review recent advances in how summary statistics can be used to build theoretical understanding of neural network learning. We then argue for how this perspective can inform the analysis of neural data, enabling better understanding of learning in biological and artificial neural networks.
more » « less
Free, publicly-accessible full text available August 29, 2026
Nadaraya–Watson kernel smoothing as a random energy model

https://doi.org/10.1088/1742-5468/ada49a

Zavatone-Veth, Jacob A; Pehlevan, Cengiz (January 2025, Journal of Statistical Mechanics: Theory and Experiment)

Abstract Precise asymptotics have revealed many surprises in high-dimensional regression. These advances, however, have not extended to perhaps the simplest estimator: direct Nadaraya–Watson (NW) kernel smoothing. Here, we describe how one can use ideas from the analysis of the random energy model (REM) in statistical physics to compute sharp asymptotics for the NW estimator when the sample size is exponential in the dimension. As a simple starting point for investigation, we focus on the case in which one aims to estimate a single-index target function using a radial basis function kernel on the sphere. Our main result is a pointwise asymptotic for the NW predictor, showing that it re-scales the argument of the true link function. Our work provides a first step towards a detailed understanding of kernel smoothing in high dimensions.
more » « less
Full Text Available
Learning curves for deep structured Gaussian feature models

https://doi.org/10.1088/1742-5468/ad642a

Zavatone-Veth, Jacob A; Pehlevan, Cengiz (October 2024, Journal of Statistical Mechanics: Theory and Experiment)

Abstract In recent years, significant attention in deep learning theory has been devoted to analyzing when models that interpolate their training data can still generalize well to unseen examples. Many insights have been gained from studying models with multiple layers of Gaussian random features, for which one can compute precise generalization asymptotics. However, few works have considered the effect of weight anisotropy; most assume that the random features are generated using independent and identically distributed Gaussian weights, and allow only for structure in the input data. Here, we use the replica trick from statistical physics to derive learning curves for models with many layers of structured Gaussian features. We show that allowing correlations between the rows of the first layer of features can aid generalization, while structure in later layers is generally detrimental. Our results shed light on how weight structure affects generalization in a simple class of solvable models.
more » « less
Full Text Available
Long sequence Hopfield memory

https://doi.org/10.1088/1742-5468/ad6427

Tahir_Chaudhry, Hamza; Zavatone-Veth, Jacob A; Krotov, Dmitry; Pehlevan, Cengiz (October 2024, Journal of Statistical Mechanics: Theory and Experiment)

Abstract Sequence memory is an essential attribute of natural and artificial intelligence that enables agents to encode, store, and retrieve complex sequences of stimuli and actions. Computational models of sequence memory have been proposed where recurrent Hopfield-like neural networks are trained with temporally asymmetric Hebbian rules. However, these networks suffer from limited sequence capacity (maximal length of the stored sequence) due to interference between the memories. Inspired by recent work on Dense Associative Memories, we expand the sequence capacity of these models by introducing a nonlinear interaction term, enhancing separation between the patterns. We derive novel scaling laws for sequence capacity with respect to network size, significantly outperforming existing scaling laws for models based on traditional Hopfield networks, and verify these theoretical results with numerical simulation. Moreover, we introduce a generalized pseudoinverse rule to recall sequences of highly correlated patterns. Finally, we extend this model to store sequences with variable timing between states’ transitions and describe a biologically-plausible implementation, with connections to motor neuroscience.
more » « less
Full Text Available
Partial observation can induce mechanistic mismatches in data-constrained RNNs

Qian, William; Zavatone-Veth, Jacob A; Ruben, Benjamin S; Pehlevan, Cengiz (October 2024, The First Workshop on NeuroAI @ NeurIPS2024)

Full Text Available
In-Context Learning by Linear Attention: Exact Asymptotics and Experiments

Lu, Yue; Letey, Mary; Zavatone-Veth, Jacob A; Maiti, Anindita; Pehlevan, Cengiz (October 2024, NeurIPS 2024 Workshop on Mathematics of Modern Machine Learning (M3L), 2024)

Full Text Available
Partial observation can induce mechanistic mismatches in data-constrained models of neural dynamics

Qian, William; Zavatone-Veth, Jacob A; Ruben, Benjamin S; Pehlevan, C (September 2024, Advanced in Neural Information Processing Systems (NeurIPS) 2024)

Full Text Available
Learning Curves for Deep Structured Gaussian Feature Models

Zavatone-Veth, Jacob A; Pehlevan, Cengiz (September 2023, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Long Sequence Hopfield Memory

Chaudhry, Hamza T; Zavatone-Veth, Jacob A; Krotov, Dmitry; Pehlevan, Cengiz (October 2023, Associative Memory & Hopfield Networks in 2023)

Full Text Available
Long Sequence Hopfield Memory

Chaudhry, Hamza T; Zavatone-Veth, Jacob A; Krotov, Dmitry; Pehlevan, Cengiz (September 2023, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available

« Prev Next »

Search for: All records